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NOVEL GENES INVOLVED IN THE Escherichia coli BIOFILM FORMATION AND 
USES THERE OF 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims priority to and the benefit of the filing date of U.S. Provisional 
Patent Application entitled " NOVEL GENES INVOLVED IN THE Escherichia coli BIOFILM 
FORMATION AND USES THERE OF," assigned Ser. No. 60/464,062, and filed April 17, 
2003, the text of which is hereby incorporated by reference. 

FIELD OF THE INVENTION 

[0002] This invention relates to the identification of genes involved in biofilm formation, 
methods of identifying compounds that modulate the activity of biofilm formation and/or 
effectively regulate the level of expression of biofilm formation, and to compounds so identified. 

BACKGROUND OF THE INVENTION 

[0003] Microorganisms commonly attach to living and nonliving surfaces, including those of 
indwelling medical devices, and form biofilms made up of extracellular polymers. In this state, 
microorganisms are highly resistant to antimicrobial treatment and are tenaciously bound to the 
surface. Biofilms represent a distinct physiological state, designed to provide a protected 
environment for survival under hostile conditions. Costerton, J.W. et al., Science, 284: 1318- 
1322 (1999). Many chronic infections that are difficult or impossible to eliminate with 
conventional antibiotic therapies are known to involve biofilms. A partial list of the infections 
that involve biofilms included: otitis media, prostatitis, vascular endocarditis, cystic fibrosis 
pneumonia, meliodosis, necrotizing faciitis, osteomyelitis, peridontitis, biliary tract infection, 
struvite kidney stone and host of nosocomial infections. Costerton, J.W., et al., Science, 284: 
1318-1388 (1999). 



[0004] As awareness of biofilm significance increases so does the need to understand the genetic 
mechanisms involved in its formation. Research using molecular techniques has been successful 
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in gaining some insight to this process. Gene fusion experiments have been used to observe the 
changes in gene expression during biofilm development (Lamont, et. al., 1999). Expression 
patterns of a series of genes during biofilm formation have been studied by using DNA arrays 
(Stanley, et. al., 2003; Whiteley et. al., 2001). Libraries of bacterial mutants created by 
transposon mutagenesis have been analyzed for their effects on biofilm development (Kolter et. 
at., 1998). Many of these studies take advantage of a straightforward microtiter plate assay for 
biofilm development (Huber, et. al., 2002). Formation of biofilm in the wells of a 
polyvlnylchloride (PVC) microtiter plate is visualized by crystal violet staining. This technique 
allows for the examination of many independently generated biofilms at one time (Danese, et. 
al., 2001) As a result of research efforts, important aspects of biofilm development are being 
revealed. It is now known that signal molecules for cell-to-cell communication, or quorum 
sensing, are necessary for normal biofilm development in various species of bacteria (Kolter, et. 
al., 1998). While it is recognized that flagellum mediated motility enables the bacterium to 
reach potential surface sites in certain species, current studies also implicate flagella as playing a 
role in surface attachment (Pratt, et. al., 1998). Surfacing sensing is an adaptive strategy bacteria 
use in biofilm production. 

[0005] It has been demonstrated that cells physically sense a surface through inhibition of 
flagellar rotation (Alavi, et. al., 2001). Therefore, one of the first stimuli sensed by bacteria as 
they form biofilm is mechanical in nature. Other surface components, including fimbriae and 
exopolysaccharides, have been shown to help bacteria counteract the repelling electrostatic 
forces between the surface and the bacterial envelope (Prigent-Combaret, et. al., 2000). 
Extracellular proteins or carbohydrates are known to be required for interaction with inert 
surfaces. 

[0006] Many of these studies have taken place using the genetically well-characterized 
laboratory strain E. coli K12. Using random insertion mutagenesis, transduction and the 
previously described biofilm microtiter plate assay, strains were screened for defects in biofilm 
production as well as other phenotypic changes of interest (Genevaux, et. al, 1996; Genevaux, et. 
al., 1999; Pratt, et. al., 1998; Prigent-Combart, et. al., 1999). These methods have identified 
several important gene products involved in the biofilm development of this strain (Watnick, et. 
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al., 2000). It has been determined that while motility is important to biofilm development in the 
early stages, it actually antagonizes mature biofilm formation. Type I pili have been reported to 
be key structures involved in stable cell-to-cell or cell-to-surface interactions needed for biofilm 
formation (Pratt, et. al, 1998). Other gene products have been shown to be involved in adhesion 
as well. DsbA, which catalyses the folding of extracytoplasmic proteins, is involved in the 
production of fimbriae (Genevaux, et. al., 1999). A mutation in this gene greatly affects 
adhesion. The Cpx-signaling pathway has been shown to respond to physical changes caused by 
adhesion of E. coli to surfaces (Otto, et. al., 2002). Evidence indicates that the surface induced 
response of Cpx activates a variety of genes involved in enhancing cell-surface interactions. A 
mutation in the cpxR gene dramatically lowers the number of attached cells. Once colonization 
occurs at the surface, major changes occur in the pattern of gene expression. Using transposon 
mutants generated by a MudX transposon carrying a promoterless lacZ gene, one study 
demonstrated that many of the genes in E. coli were affected at the level of transcription upon 
surface colonization (Prigent-Combaret, et. al., 1999). The proU operon (encoding a glycine 
betaine transport system), ompC (a porin encoding gene), wca locus (involved in colanic acid 
synthesis), pepT (encoding tripeptidase T) and nikA (involved in the nickel transport system) all 
have increased expression in attached bacteria. However, the expression of the flagellin gene, 
fliC, as well as a protein containing 92 amino acids (/92) are reduced in sessile bacteria. Some 
of these changes were also seen when the bacteria were grown in high salt concentrations, 
indicating that an increase in osmolarity may trigger increased expression of some biofilm- 
induced genes. 

[0007] Recent studies have established that the global regulatory system Csr (carbon storage 
regulator) of E. coli regulates biofilm production and dispersal (Jackson et. al., 2002). The Csr 
system is a global regulatory process that influences gene expression post-transcriptionally 
(Romeo, 1998). The active element of this system is a RNA binding protein designated CsrA 
(Romeo, et. al., 1993). CsrA acts by binding to RNA transcripts, facilitating their decay. The 
Csr system has been shown to activate metabolic pathways needed for exponential growth while 
repressing the expression of stationary phase genes (Romeo, 1998). The effects of the Csr 
regulon are many including glycogen metabolism, gluconeogenesis and motility (Romeo, et. al., 
1993; Wei, et. al., 2001). A mutation in the gene encoding CsrA resulted in a significant increase 
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in biofilm formation (Jackson, et. al., 2002). A csrA mutant forms a biofilm in the absence of 
surface factors known to be important in biofilm development. The effects of the CsrA mutation 
are greater than other known global regulators involved in biofim formation, RpoS and OmpR. 
CsrA mainly, though not exclusively, affects biofilm formation through its control of glycogen 
metabolism, Previous studies have identified CsrA as a post- transcriptional repressor of the 
glgCAP operon (Yang, et. al, 1996). Glycogen synthesis and catabolism are essential for 
optimal biofilm development. Furthermore, ectopic expression of glycogen synthase (encoded 
by glgA) and the catabolic enzyme glycogen phosphorylase (encoded by glgP) significantly 
increases biofilm formation. Since a csrA mutant overexpresses the glycogen biosynthetic genes 
(glgCA) as well as the gene for glycogen catabolism (glgP), an increase in glycogen metabolism 
occurs. In a csrA mutant, this increase in glycogen synthesis and its subsequent turn over 
improves biofilm formation (Jackson et. al., 2002). It has been proposed that glycogen may 
serve as a carbon and energy source for the synthesis of adhesions or other factors necessary in 
biofilm production during stationary phase. 

[0008] Since motility and surface attachment abilities in E. coli are important in the initiation of 
biofilm production (Jackson et. al., 2002), many of the mutants generated in biofilm studies 
involve these processes (Pratt, et. al., 1998). To decrease the probability of creating mutations in 
genes already associated with biofilm production, a non-motile (AmotB) attachment deficit 
(AfimB-H) strain of E. coli K12 MG1655 was used as the parent strain in constructing a library 
of mini Tnl0::cam transposon mutants. Such a strain forms little to no biofilm. When E. coli 
MG1655 is rendered non-motile by a motB deletion, biofilm formation is virtually eliminated. A 
mutation in motB, whose product is involved in the flagellum motor switch, renders the bacteria 
paralyzed although flagellum synthesis continues (Pratt, et. al., 1998). A type I pili deletion 
dramatically reduced biofilm formation as well (Jackson et. al., 2002). Deleting JimB-H creates 
a strain unable to produce type I pili (Pratt, et. al., 1998). To facilitate the isolation of biofilm 
down mutants, a csrA mutation was introduced into this strain. In a csrA mutant, these deletions 
decreased biofilm formation as compared to the wild type, however it was not eliminated 
(Jackson et. al., 2002). The csrA mutation would allow for the identification of genes whose 
interruption altered the biofilm phenotype even in the absence of motility and type I pili. The 
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identification of those interrupted genes would result in the description of novel genes involved 
in the process of biofilm formation. 

BREIF DESCRIPTION OF FIGURES 

[0009] Figure 1 is a schematic diagram of a primed polymerase chain reaction. 

[00010] Figure 2 is a schematic diagram of genes or operons for complementation studies. 

[00011] Figure 3 is a schematic diagram of vector pCR2.1 with insertion cloning site and 
resistance gene for Ampicillin. 

[00012] Figure 4A and 4B are charts showing cell growth (total turbidity at A630 before 
staining) and biofilm formation A630 after crystal violet staining) absorbance readings for parent 
strains and representative mutant strains. 

[00013] Figure 5 lists sequences for mutants of the invention. 
SUMMARY OF THE INVENTION 

[00014] Transposon mutagenesis using a mini TnT O:\camR resistance marker yielded 800 
random insertion mutants displaying altered biofilm phenotypes as compared to a non-motile, 
attachment deficient csrA mutant strain. Co-transduction with the resistance marker confirmed 
the linkages of 120 biofilm mutants (see the Sequences of Down Mutants and Up Mutants). 
Amplification of the insertion sites nucleotide sequencing and BLAST searches against E. coli 
K-12 genomic databases, successfully identified 118 of these sites. Many of the interrupted 
genes were not presently known to be associated with biofilm formation. A plasmid clone of the 
nhaAR operon complemented the corresponding mutation and increased biofilm formation by 
-10-fold. Transduction of four of the other interrupted genes into E. coli K-12 MG1655 
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produced altered biofilm phenotypes. Sixty-one different genes identified in this library of 
mutants influence biofilm formation. 

[00015] Although the complete genome of E. coli K-12 was reported by Blattner (Science 
277: 1453(1997) the report failed to suggest the involvement of these genes in biofilm formation. 
The invention also relates to the potential use of these genes or their products for modulating 
biofilm formation. In particular, the invention relates to use of these genes or their products as 
targets for developing antimicrobials, including antibiofilm products. 

[00016] From 120 mutants of Escherichia coli K-12 strain produced by transposon 
mutagenesis, the involvement of sixty-one genes in Escherichia coli biofilm formation has been 
established. 

[00017] Genes involved in the biosynthesis of flagella (JlgB.flgE, flgH t flgl flgK, flhA, 
flhB, flhD, fliA, fliD, fliF, fliG, flil; fliL, fliM, flip, fliR, dnaK and mdoG), genes involved in 
lipopolysaccharide biosynthesis (rfaQ, rfaG, rfaP and wecB\ gene affecting swarming behavior 
{yojN), genes involved in cellular respiration (arcB and frdA) and genes which have no known 
functions (bl936, yhjH t tolA, rnhB, rep and mreB) cause 2 to 12-fold increase in biofilm 
formation when interrupted. 

[00018] Genes involved in the synthesis or degradation of proteins (aroD, clpX, leuO, 
leuL, miaA, prfQ ptrB, rnpB) genes involved in carbon metabolism (fucA and rbsK), gene 
involved in nitrogen metabolism (glnE), gene involved in colanic acid synthesis (weal), genes 
involved in molybdopterin synthesis (modA, modC and moaC), gene involved in adaptation to 
high sodium concentration (nhaA), gene involved in cellular respiration (yecK), genes involved 
in the biosynthesis of polysaccharide adhesin (ycdS, ycdR, ycdQ andycdP), gene associated with 
ferritin encoding gene fin (bl904) and genes which have no known functions (b2531, yedK, 
yjcC, yjjQ, ykgK, hscA and trs5_7) cause 2 to 24-fold decrease in biofilm formation when 
interrupted 
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[00019] The genes of the invention and their corresponding polypeptides 
(proteins/enzymes) thus provide targets for the development of antibacterial and antibiofilm 
compounds, including antibacterial or antibiofilm compounds either inhibit or bind to the 
potential antibacterial or antibiofilm targets (products of genes). These compounds may be 
synthetic or natural. Thus, the present invention provides methods for identifying compounds 
which modulate biofilm formation, comprising identifying compounds which modulate the 
activity of a polynucleotide or polypeptide of the invention. The present invention also provides 
methods for identifying environmental factors which modulate biofilm formation, comprising 
identifying environmental factors which modulate the activity of a polynucleotide or polypeptide 
of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

[00020] Biofilm formation is a complex developmental process (Danese, et. al., 2001). 
Bacteria have the ability to monitor the environment and make use of the information obtained. 
Environmental and surface conditions determine if biofilm development takes place: If 
conditions are favorable, changes in gene expression occur soon after surface interaction. It is 
known that bacteria in biofilms are phenotypic variations of their planktonic counterparts 
(Lamont, et. al., 1999). This biofilm-induced change is systematic and ongoing during the entire 
course of biofilm formation (Sauer, et. al., 2002). This study focuses on the molecular 
mechanisms that take place within the first 24 hours of biofilm formation. It has been shown that 
motility and surface attachment abilities are important in the initial stages of biofilm production 
by Escherichia coli (Jackson, et. al., 2002). Previous biofilm studies involving E. coli have 
isolated mutations that affect these processes (Pratt, et. al., 2002). Choosing a non-motile strain 
lacking type I fimbriae allowed this study to focus on the genes not previously known to be 
associated with biofilm formation. 

[00021] Over half of the mutations that produced an increased amount of biofilm were 
located in genes involved with flagellar function or synthesis. Many of the mutations could 
result in relieving steric hindrance by reducing the number of flagella present. A reduction in 
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flagellum synthesis occurs during biofilm maturation, indicating their motility may antagonize 
the process (Prigent-Combaret, et. al., 1999; Watnick, et. al., 2000). However, this would not 
explain the effect of the fliD mutation, which eliminates filament cap proteins. This protein is 
responsible for terminating the transport and polymerization of flagellin during flagellum 
growth. Disruption of dnaK and mdoG, both linked to flagellum synthesis (Shi, et. al., 1992; 
Kennedy, 1992), increased biofilm production as well. Although these mutations are found in a 
diverse group of flagellum genes, perhaps they reflect a common cause. It is known that cells 
sense a surface by inhibition of flagellar rotation expression (Davey, et. al., 2000; Alavi, et. al., 
2001). The prevention of rotation has been linked to signal transduction and changes in cell gene 
expression (Davey, et. al., 2000). The background motB deletion in the transposon mutants 
render the flagella paralyzed (Pratt, et. al., 1998). Perhaps this motB mutation effects a signaling 
pathway that normally detects surface interaction through inhibition of flagellum movement. 
Mutations in the flagellum genes may have relieved this suppression. 

[00022] A gene involved in the phosphorelay-signaling pathway (Takeda et. al.; 2001) 
was interrupted in a biofilm up mutant. This gene, yojN, is part of a signaling pathway involved 
in modulating swarming behavior and colanic acid synthesis. It has been demonstrated that this 
system regulates gene expression during colonization on a surface. Swarmer cell differentiation 
in Proteus mirabilis has been linked to changes in flagellar rotation at a surface or in viscous 
medium (Belas, et. al., 1998). Perhaps the phosphorelay signaling pathway is affected by the 
background motB deletion in the same manner as the flagellum genes. 

[00023] Genes involved in lipopolysaccharide (LPS) core biosynthesis (rfaG, rfaP, and 
rfaQ) and the surface O-polysaccharide (wecB\ Barua, et., al., 2002), were interrupted in 10 
biofilm-up mutants. Studies have demonstrated LPS inhibits biofilm formation (Mireles, et. al., 
2001). The length of the O antigenic polysaccharide of LPS inversely correlates with bacterial 
adhesion in several strains of gram- negative bacteria (Landini, et. al., 2002). Inactivation of the 
rfa operon leads to a deep rough phenotype, which can enhance bacterial adhesion to surfaces in 
some conditions. 
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[00024] It has been demonstrated that mutations in surface and membrane proteins lead to 
defects in surface attachment in several gram-negative bacteria (Sauer, et. al., 2001). The outer 
membrane of gram- negative bacteria has an important role in promoting responses to 
environmental change (Otto, et. al, 2001). In nonfimbriated E. coli, inhibition of protein 
synthesis and turnover resulted in a decrease of outer membrane proteins and surface attachment. 
Many of the mutations identified in this study have a role, directly or indirectly, in protein 
synthesis and degradation. aroD (involved in amino acid biosynthesis), clpX (encoding a serine 
protease/chaperone), leuO and leuL (involved in leucine biosynthesis), miaA (involved in tRNA 
modification), prfC (encoding peptide chain release factor 3), and in particular ptrB (encoding a 
protease) and rnpB (encoding RNase P) led to decreases in biofilm development when 
interrupted. 

[00025] Protein analysis has demonstrated changes in protein levels as early as 8 hours 
after attachment (Sauer, et. al, 2002). An increase in proteins involved in metabolic pathways 
such as cofactor biosynthesis, carbon catabolism and amino acid metabolism has been shown. 
Genes involved in carbon catabolism, fucA (encoding for L-fuculose-1 -phosphate aldolase) and 
rbsK (encoding for ribokinase), were interrupted in 2 biofilm down mutants. A gene involved in 
the regulation of nitrogen consumption (glnE), when interrupted, caused a 2-fold decrease in 
biofilm production, weal created a decrease in biofilm by 24-fold when disrupted. The product of 
weal is an unclassified glycosyl transferase known to be involved in colonic acid synthesis. 
Glycosyl transferases are involved in a variety of metabolic pathways. Genes involved in 
molybdopterin biosynthesis (modA, modC and moaC) were identified in three biofilm-down 
mutants. The genes of the moa operon as well as modA and modB are required for the first step in 
molybdopterin synthesis (Anderson, et. al., 2000). Molybdoenzymes are important in many 
bacterial metabolic pathways. 

[00026] The environment within a biofilm is diverse and unlike the environment 
encountered by the free-living bacteria. Bacteria approaching a surface can encounter an 
increase in organic and inorganic ion concentrations, lower pH levels and changes in osmolality 
(Prigent-Combaret, et. al., 1999). Surface attached E. coli have higher intracellular K + levels 
than their planktonic counterparts after 10 hours of incubation, indicating attached bacteria 
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encounter higher-osmolarity conditions. Studies have shown that membrane proteins principally 
involved in molecular transport and adaptation increase following adhesion (Sauer, et. al., 2002). 
nhaR, a gene involved in regulating nhaA, which encodes a Na^/H 4 " antiporter (Dover, et. al., 
2001), and osmC, encoding for an envelope protein of unknown function (Toesca, et. al., 2001), 
caused a decrease in biofilm when interrupted. nhaR positively regulates nhaA in a Na + 
dependent manner. nhaA is necessary for adaptation to high sodium concentrations and alkaline 
pH (Dover, et. al., 2002). nhaR is needed for the induction of osmC during increased osmolarity 
(Toesca, et. al., 2001). 

[00027] As the biofilm develops, microenvironments form (Watnick, et. al., 2000). 
Bacteria associated with single-species biofilms must alter gene expression to survive in their 
microenvironment. Microelectrode measurements show oxygen and pH levels fall, proximal to 
the surface. Three genes identified in this study are involved in cellular respiration. arcB is part 
of a two-component signal transduction system involved in regulation of over 30 operons 
(Kwon, et. al., 2000). This system allows E. coli to sense various respiratory conditions 
(Matsushika, et. al., 1998). Interruption of arcB lead to increased biofilm production. 
Disruption of frdA, involved in fumerate reductase production, lead to an increase in biofilm. 
yecK gene, also called tor Y f encodes for a cytochrome-C type protein (Gon, et. al., 2000). When 
interrupted, yecK created a decrease of approximately 20-fold. This gene is part of a recently 
described respiratory system. Preliminary characterizations of this system indicate it is closely 
related to the Tor and Dor respiratory systems of E. coli (Gon, et. al., 2000). 

[00028] Intercellular adhesion occurs in the formation of microcolonies and is necessary 
for mature biofilm development (Cramton, et. al., 1999). Twenty-two biofilm-down mutants had 
interrupted genes located within the ycd operon (ycdP, ycdQ, ycdR and ycdS). The resulting 
phenotypic changes ranged from 7- to 20-fold. Protein products from this operon currently have 
undefined functions. Two of the genes (ycdQ and ycdR) have protein products homologous to 
IcaA and IcaB in Staphylococcus aureus. This locus has been shown to mediate polysaccharide 
intercellular adhesin (PIA) synthesis (Cramton, et. al, 1999). The intercellular adhesin locus 
(ica) is required for biofilm formation in & aureus and S. epidermidis. 
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[00029] One gene whose disruption caused a 24-fold decrease in biofilm was bl904. 
Currently this gene's function is unknown, however it is directly upstream from the ferritin 
encoding gene ftn. Possibly, interruption of this gene created a polar effect with fin. Previous 
research has indicated that iron may be an important factor in the initiation of biofilm formation 
in Pseudomonas aerugionsa (O Toole, et. al., 1998). 

[00030] Several of the genes identified in this study have no known function or 
homologue of known function. Disruption of b2531, yedK, yjcC, yjjQ and ykgK resulted in a 
decrease in biofilm production, while interruption of bl936 and yhjH created an increase. The 
effect of disruption of yjjQ was particularly notable. The observations in this study might 
facilitate future studies that characterize these genes by giving insight into the function their 
products have. A Current literature review yielded no beneficial information regarding the 
possible correlation between the phenotypic changes seen concerning biofilm formation and the 
interruption of trs5J7, hscA, mreB, rep, rnhB and tolA. Therefore, no details are known of these 
genes' involvement with this process. 

[00031] Of the sixty-one different genes identified, only those involved in motility are 
known to be associated with biofilm production. Interestingly, interruption of these genes 
created a change in the biofilm phenotype contrary to what has been seen in published research 
where disruption of motility decreased biofilm formation. Transduction of four gene mutations 
(nhaR, leuO, yjjQ and yojN) identified in this study into wild type E. coli K12 MG1655 created 
changes in biofilm formation. However, the mutations produced less of an impact in the wild 
type than with TRMG F/M. Complementation of the nhaR interruption using the plasmid 
construct pCRnhaR confirmed that this gene has an important role in biofilm formation. 

Screening Assays using the Identified Genes, Gene Products. 

[00032] The genes and gene products identified in this study represent immediate targets 
for screens that would identify additional influences on biofilm development. A variety of 
existing procedures can be used to develop such procedures. For example, reporter constructs 
can be prepared for any gene of interest, which will allow the identification and measurement of 
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genetic or environmental influences on the expression of that gene. A widely employed example 
of this is the use of 'lacZ transcriptional or translational fusions (example: Romeo et al., 1990; 
1993). First, a fusion gene is created in which the regulatory elements that are located within the 
5 '-region of the gene are fused to the coding region of lacZ. This construct will express beta- 
galactosidase under the regulatory control of the gene of interest. Next, one can create mutants, 
either by directed or random procedures that alter the expression of the gene of interest. 
Screening with X-gal or on McConkey agar will allow mutations that influence the expression of 
the gene of interest. 

[00033] The same kind of gene fusion permits environmental influences on gene 
expression to be assessed, including growth conditions and the presence of inhibitory agents or 
chemicophysical stresses. In the case of genes that are essential for biofilm formation, agents 
that inhibit biofilm formation by blocking the expression may be identified. An advantage of 
this kind of compound would be that it should not kill cells, but only alter their ability to form 
biofilm. Thus, resistances that are typical for drugs that kill or inhibit growth may not develop. 
Of course, many different gene expression reporters can be similarly employed, including 
antibiotic markers, such as chloramphenicol acetyl-transferase (cam) or agents that allow 
immediate visualization of effects, such as green fluorescent protein (gfp). Factors that influence 
the production of proteins that are encoded by biofilm-related genes can also be assessed, e.g. by 
immunological (Western) blotting (e.g. Liu et al., 1997). 

[00034] Thus, the present teachings provide methods for screening compounds to identify 
those which enhance (agonist) or block (antagonist) the action of subject polypeptides or 
polynucleotides, and thus the formation of biofilms. Drug screening assays are made effective 
by use of the control regions of the genes described in the present invention or part of it, in a host 
based DNA-protein interaction assay. 

[00035] For example, to screen for agonists or antagonists, a synthetic reaction mix, a 
cellular compartment, such as a membrane, cell envelope or cell wall, or a preparation of any 
thereof, may be prepared from a cell that expresses a molecule that binds a subject 
polynucleotide. The preparation is incubated with labeled polynucleotide in the absence or the 
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presence of a candidate molecule which may be an agonist or antagonist. The ability of the 
candidate molecule to bind the binding molecule is reflected in decreased binding of the labeled 
ligand. 

[00036] Effects of potential agonists and antagonists may by measured, for instance, by 
determining activity of a reporter system following interaction of the candidate molecule with a 
cell or appropriate cell preparation, and comparing the effect to a baseline (control) 
measurement. 

[00037] Reporter systems that may be useful in this regard include, but are not limited to, 
colorimetric labeled substrate converted into product, a reporter gene that is responsive to 
changes in elongase enzyme activity, and binding assays known in the art. 

[00038] Another example of an assay for antagonists is a competitive assay that combines 
a subject polypeptide and a potential antagonist with membrane-bound subject polypeptide- 
binding molecules, recombinant subject polypeptide binding molecules, natural substrates or 
ligands, or substrate or ligand mimetics, under appropriate conditions for a competitive inhibition 
assay. A subject polypeptide can be labeled, such as by radioactivity or a colorimetric 
compound, such that the number of subject polypeptide molecules bound to a binding molecule 
or converted to product can be determined accurately to assess the effectiveness of the potential 
antagonist. 

[00039] Potential antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotide or polypeptide of the invention and thereby inhibit or 
extinguish its activity. Potential antagonists also may be small organic molecules, peptides, 
polypeptides, such as closely related proteins or antibodies that bind the same sites on a binding 
molecule, without inducing subject polypeptide-induced activities, thereby preventing the action 
of the subject polypeptide by excluding the subject polypeptide from binding. Potential 
antagonists include antisense molecules (Okano et al., 1988, EMBO J., 7: 3407-3412). Potential 
antagonists include compounds related to and derivatives of the subject polypeptides. Potential 
antagonists may be selected from the group consisting of small organic molecules, peptides, 
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polypeptides, antisense molecules, oligonucleotides, polynucleotides, fatty acids, and chemical 
and functional derivatives thereof. Selective modulators may include, for example, antibodies 
and other proteins or peptides which specifically bind to the subject polypeptide or 
polynucleotide, oligonucleotides which specifically bind to the subject polypeptide (see Patent 
Cooperation Treaty International Publication No. WO93/05182 which describes methods for 
selecting oligonucleotides which selectively bind to target biomolecules) or the subject 
polynucleotide (e. g., antisense oligonucleotides) and other non-peptide natural or synthetic 
compounds which specifically bind to the subject polynucleotide or polypeptide. Mutant forms 
of the subject polynucleotide which alter the activity of the subject polypeptide or its localization 
in a cell are also contemplated. Crystallization of recombinant subject polypeptides alone and 
bound to a modulator, analysis of atomic structure by X-ray crystallography, and computer 
modeling of those structures are methods useful for designing and optimizing non-peptide 
selective modulators. See, for example, Erickson et al., 1992, Ann. Rep. Med.Chez., 27: 271-289 
for a general review of structure-based drug design. 

DEFINITIONS 

[00040] To facilitate a complete understanding of the invention, the terms defined below 
have the following meaning: 

[00041] Biofilm refers to a community of microorganisms growing on a surface (inert or 
living) enclosed in an exopolysaccharide matrix. 

[00042] Compositions include genes, proteins, polynucleotides, peptides, compounds, 
drugs, and pharmacological agents. 

[00043] Disorder as used herein refers to derangement or abnormality of structure or 

function. Disorder includes disease. 

[00044] Gene refers to a nucleic acid molecule or a portion thereof, the sequence of 

which includes information required for the production of a particular protein or polypeptide 
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chain. The polypeptide can be encoded by a full-length sequence or any portion of the coding 
sequence, so long as the functional activity of the protein is retained. A gene may comprise 
regions preceding and following the coding region as well as intervening sequences (introns) 
between individual coding sequences (exons). A "heterologous" region of a nucleic acid 
construct (i.e. a heterologous gene) is an identifiable segment of DNA within a larger nucleic acid 
construct that is not found in association with the other genetic components of the construct in 
nature. Thus, when the heterologous gene encodes a biofilm gene, the gene will usually be 
flanked by a promoter that does not flank the structural genomic DNA in the genome of the 
source organism. 

[00045] Host system may comprise a cell, tissue, organ, organism or any part thereof, 

which provides an environment or conditions that allow for, or enable, transcription and/or 
transcription. 

[00046] Identity, similarity, homology or homologous, refer to relationships between 

two or more polynucleotide sequences, as determined by comparing the sequences. In the art, 
identity also means the degree of sequence relatedness between polynucleotide sequences, as the 
case may be, as determined by the match between strings of such sequences. Both identity and 
similarity can be readily calculated (Lesk A.M., ed., 1988, Computational Molecular Biology, 
Oxford University Press, NY; Smith D.W., ed., 1993, Biocomputing: Informatics and Genome 
Project, Academic Press, NY; Griffin A.M. and Griffin H.G., eds., 1994, Computer Analysis of 
Sequence Data, Part 1, Humana Press, NJ; von Heijne G., 1987, Sequence Analysis in Molecular 
Biology, Academic Press, NY and Gribskov M. and Devereux J., eds., 1991, Sequence Analysis 
Primer, M Stockton Press, NY). While there exist a number of methods to measure identity and 
similarity between two polynucleotide sequences, both terms are well known to skilled artisans 
(von Heijne G., 1987, Sequence Analysis in Molecular Biology, Academic Press, NY; Gribskov 
M. and Devereux J., eds., 1991, Sequence Analysis Primer, M Stockton Press, NY and Carillo H. 
and Lipman D., 1988, SIAM J. Applied Math., 48: 1073). Methods commonly employed to 
determine identity or similarity between sequences include, but are not limited to those disclosed 
in Carillo H. and Lipman D., 1988, SIAM J. Applied Math., 48: 1073. Methods to determine 
identity and similarity are codified in computer programs. Computer program methods to 



15 



determine identity and similarity between two sequences include, but are not limited to, GCG 
program package (Devereux et al., 1984, Nucl Acid Res., 12: 387-395), BLASTP, BLASTN and 
FASTA (Altschul et al, 1990, 1 Molec. Biol, 215: 403-410). 

[00047] Isolated means altered "by the .hand of man' 1 from its natural state; i.e., that, if 

it occurs in nature, it has been changed or removed from its original environment, or both. For 
example, a naturally occurring polynucleotide naturally present in a living organism in its natural 
state is not "isolated," but the same polynucleotide separated from coexisting materials of its 
natural state is "isolated", as the term is employed herein. As part of or following isolation, such 
polynucleotides can be joined to other polynucleotides, such as DNA, for mutagenesis, to form 
fusion proteins, and for propagation or expression in a host, for instance. The isolated 
polynucleotides, alone or joined to other polynucleotides such as vectors, can be introduced into 
host cells, in culture or in whole organisms. Introduced into host cells in culture or in whole 
organisms, such DNA still would be isolated, as the term is used herein, because they would not 
be in their naturally occurring form or environment. Similarly, the polynucleotides may occur in a 
composition, such as a media formulations, solutions for introduction of polynucleotides, for 
example, into cells, compositions or solutions for chemical or enzymatic reactions, for instance, 
which are not naturally occurring compositions, and, therein remain isolated polynucleotides 
within the meaning of that term as it is employed herein. 

[00048] Mutation. A "mutation" is any detectable change in the genetic material. A 

mutation can be any (or a combination of) detectable, unnatural change affecting the chemical or 
physical constitution, mutability, replication, phenotypic function, or recombination of one or 
more deoxyribonucleotides; nucleotides can be added, deleted, substituted for, inverted, or 
transposed to new positions with and without inversion. Mutations can occur spontaneously and 
can be induced experimentally by application of mutagens or by site-directed mutagenesis. A 
mutant polypeptide can result from a mutant nucleic acid molecule. 

[00049] Nucleic acid construct refers to any genetic element, including, but not 

.limited to, plasmids and vectors, that incorporate polynucleotide sequences. For example, a 
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nucleic acid construct may be a vector comprising a promoter or control region that is operably 
linked to a heterologous gene. 

[00050] Operably linked as used herein indicates the association of a promoter or 

control region of a nucleic acid construct with a heterologous gene such that the presence or 
modulation of the promoter or control region influences the transcription of the heterologous 
gene, including genes for reporter sequences. Operably linked sequences may also include two 
segments that are transcribed onto the same RNA transcript. Thus, two sequences, such as a 
promoter and a "reporter sequence" are operably linked if transcription commencing in the 
promoter produces an RNA transcript of the reporter sequence. 

[00051] Plasmids. Starting plasmids disclosed herein are either commercially available, 
publicly available, or can be constructed from available plasmids by routine application of well 
known, published procedures. Many plasmids and other cloning and expression vectors that can 
be used in accordance with the present invention are well known and readily available to those of 
skill in the art. Moreover, those of skill readily may construct any number of other plasmids 
suitable for use in the invention. 

[00052] Polynucleotides(s) of the present invention may be in the form of RNA, such as 
mRNA, or in the form of DNA, including, for instance, cDNA and genomic DNA obtained by 
cloning or produced by chemical synthetic techniques or by a combination thereof. The DNA 
may be double-stranded or single-stranded. Single-stranded polynucleotides may be the coding 
strand, also known as the sense strand, or it may be the non-coding strand, also referred to as the 
anti-sense strand. Polynucleotides generally refers to any polyribonucleotide or 
polydeoxribonucleotide, which may be unmodified RNA or DNA or modified RNA or DNA. 
Thus, for instance, polynucleotides as used herein refers to, among others, single-and double- 
stranded DNA, DNA that is a mixture of single- and double-stranded regions or single-, double- 
and triple-stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- 
and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single- 
stranded or, more typically, double-stranded, or triple-stranded, or a mixture of single- and 
double-stranded regions. In addition, polynucleotide as used herein refers to triple-stranded 
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regions comprising RNA or DNA or both RNA and DNA. The strands in such regions may be 
from the same molecule or from different molecules. The regions may include all of one or more 
of the molecules, but more typically involve only a region of some of the molecules. One of the 
molecules of a triple-helical region often is an oligonucleotide. As used herein, the term 
polynucleotide also includes DNA or DNA that contain one or more modified bases. Thus, DNA 
or DNA with backbones modified for stability or for other reasons are "polynucleotides" as that 
term is intended herein. Moreover, DNA or DNA comprising unusual bases, such as inosine, or 
modified bases, such as tritylated bases, to name just two examples, are polynucleotides as the 
term is used herein. It will be appreciated that a great variety of modifications have been made to 
DNA and RNA that serve many useful purposes known to those of skill in the art. The term 
polynucleotide as it is employed herein embraces such chemically, enzymatically or 
metabolically modified forms ^of polynucleotides, as well as the chemical forms of DNA and 
RNA characteristic of viruses and cells, including simple and complex cells, inter alia. 
Polynucleotides embraces short polynucleotides often referred to as oligonucleotide(s). It will 
also be appreciated that RNA made by transcription of this doubled stranded nucleotide sequence, 
and an antisense strand of a nucleic acid molecule of the invention or an oligonucleotide fragment 
of the nucleic acid molecule, are contemplated within the scope of the invention. An antisense 
sequence is constructed by inverting the sequence of a nucleic acid molecule of the invention, 
relative to its normal presentation for transcription. Preferably, an antisense sequence is 
constructed by inverting a region preceding the initiation codon or an unconserved region. The 
antisense sequences may be constructed using chemical synthesis and enzymatic ligation 
reactions using procedures known in the art. 

[00053] Promoter refers to a nucleic acid sequence comprising a DNA regulatory 

element capable of binding RNA polymerase directly or indirectly to initiate transcription of a 
downstream (3 ? direction) gene. In accordance with the present invention, a promoter of a nucleic 
acid construct that includes a nucleotide sequence, wherein the nucleotide sequence may be 
linked to a heterologous gene such that the induction of the promoter influences the transcription 
of the heterologous gene. 

[00054] Pu rifled. A "purified" protein or nucleic acid is a protein or nucleic acid 
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preparation that is generally free of contaminants, whether produced recombinantly, chemically 
synthesized or purified from a natural source. 

[00055] Recombinant refers to recombined or new combinations of nucleic acid 

sequences, genes, or fragments thereof which are produced by recombinant DNA techniques and 
are distinct from a naturally occurring nucleic acid sequence 

[00056] Regulatory element refers to a deoxyribonucleotide sequence comprising the 

whole, or a portion of, a nucleic acid sequence to which an activated transcriptional regulatory 
protein, or a complex comprising one or more activated transcriptional regulatory proteins, binds 
so as to transcriptionally modulate the expression of an associated gene or genes, including 
heterologous genes. 

[00057] Reporter gene is a nucleic acid coding sequence whose product is a 

polypeptide or protein that, is not otherwise produced by the host cell or host system, or which is 
produced in minimal or negligible amounts in the host cell or host system, and which is 
detectable by various known methods such that the reporter gene product may be quantitatively 
assayed to analyse the level of transcriptional activity in a host cell or host system. Examples 
include genes for luciferase, chloramphenicol acetyl transferase (CAT), beta-galactosidase, 
secreted placental alkaline phosphatase and other secreted enzymes. 

[0.0058] Silencer refers to a nucleic acid sequence or segment of a DNA control region 

such that the presence of the silencer sequence in the region of a target gene suppresses the 
transcription of the target gene at the promoter through its actions as a discrete DNA segment or 
through the actions of trans-acting factors that bind to these genetic elements and consequently 
effect a negative control on the expression of a target gene. 

[00059] Stringent hybridization conditions are those which are stringent enough to 

provide specificity, reduce the number of mismatches and yet are sufficiently flexible to allow 
formation of stable hybrids at an acceptable rate. Such conditions are known to those skilled in 
the art and are described, for example, in Sambrook et al., 1989, Molecular Cloning, 2nd Edition, 
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Cold Spring Harbor Laboratory Press, Cold Spring Harbour, NY or Ausubel et al., 1994-, Current 
Protocols in Molecular Biology, John Wiley & Sons, NY. By way of example only, stringent 
hybridization with short nucleotides may be carried out at 5-10°C below the Tm using high 
concentrations of probe such as 0.01-1.0 pmole/ml. Preferably, the term "stringent conditions 1 ' 
means hybridization will occur only if there is at least 95% and preferably at least 97% identity 
between the sequences. 

[00060] Tag refers to a specific short amino acid sequence, or the oligonucleotide 

sequence that encodes it, wherein said amino acid or nucleic acid sequence may comprise or 
encode, for example, a c-myc epitope and/or a string of six histidine residues recognizable by 
commercially available antibodies. In practice, a tag facilitates the subsequent identification and 
purification of a tagged protein. 

[00061] Tagged protein as used herein refers to a protein comprising a linked tag 

sequence. For example, a tagged protein includes a biofilm polypeptide linked to a c-myc epitope 
and six histidine residues at the carboxyl terminus of the amino acid sequence. 

[00062] Test compounds as used herein encompass small molecules (e.g. small 

organic molecules), pharmacological compounds or agents, peptides, proteins, antibodies or 
antibody fragments, and nucleic acid sequences, including DNA and RNA sequences. 

[00063] Transfection refers to a process whereby exogenous or heterologous DNA (i.e. a 
nucleic acid construct) is introduced into a recipient eukaryotic host cell. Therefore, in eukaryotic 
cells, the acquisition of exogenous DNA into a host cell is referred to as transfection. In 
prokaryotes and eukaryotes (for example, yeast and mammalian cells) introduced DNA may be 
maintained on an episomal element such as a plasmid or integrated into the host genome. With 
respect to eukaryotic cells, a stably transfected cell is one in which the introduced DNA has 
become integrated into a chromosome so that it is inherited by daughter cells through 
chromosome replication. This stability is demonstrated by the ability of the eukaryotic cell to 
establish cell lines or clones comprised of a population of daughter cells containing the 
introduced DNA. 
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[00064] Transformation refers to a process whereby exogenous or heterologous DNA 
(i.e. a nucleic acid construct) is introduced into a recipient prokaryotic host cell. Therefore, in 
prokaryotic cells, the acquisition of exogenous DNA into a host cell is referred to as 
transformation. Transformation in eukaryotes refers to the conversion or transformation of 
eukaryotic cells to a state of unrestrained growth in culture, resembling a tumorigenic condition. 
In prokaryotes and eukaryotes (for example, yeast and mammalian cells) introduced DNA may be 
maintained on an episomal element such as a plasmid or integrated into the host genome. With 
prokaryotic cells, a stably transformed bacterial cell is one in which the introduced DNA has 
become integrated into a chromosome so that it is inherited by daughter cells through 
chromosome replication. This stability is demonstrated by the ability of the prokaryotic cell to 
establish cell lines or clones comprised of a population of daughter cells containing the 
introduced DNA. 

[00065] Transfection/transformation as used herein refers to a process whereby 

exogenous or heterologous DNA (e.g. a nucleic acid construct) has been introduced into a 
eukaryotic or prokaryotic host cell or into a host system. 

[00066] Variant(s) of polynucleotides are polynucleotides that differ in nucleotide 
sequence from another, reference polynucleotide. A "variant" of a protein or nucleic acid is meant 
to refer to a molecule substantially similar in structure and biological activity to either the protein 
or nucleic acid. Thus, provided that two molecules possess a common activity and can substitute 
for each other, they are considered variants as that term is used herein even if the composition or 
secondary, tertiary, or quaternary structure of one of the molecules is not identical to that found in 
the other, or if the amino acid or nucleotide sequence is not identical. Generally, differences are 
limited so that the nucleotide sequences of the reference and the variant are closely similar overall 
and, in many regions, identical. Changes in the nucleotide sequence of the variant may be silent. 
That is, they may not alter the amino acids encoded by the polynucleotide. Where alterations are 
limited to silent changes of this type a variant will encode a polypeptide or polynucleotide with 
the same amino acid sequence as the reference. Changes in the nucleotide sequence of the variant 
may alter the amino acid sequence of a polypeptide encoded by the reference polynucleotide. 
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Such nucleotide changes may result in amino acid substitutions, additions, deletions, fusions and 
truncations in the polypeptide or polynucleotide encoded by the reference sequence. 

[00067] Vector. A plasmid or phage DNA or other DNA sequence into which DNA can be 
inserted to be cloned. The vector can replicate autonomously in a host cell, and can be further 
characterized by one or a small number of endonuclease recognition sites at which such DNA 
sequences can be cut in a determinable fashion and into which DNA can be inserted. The vector 
can further contain a marker suitable for use in the identification of cells transformed with the 
vector. Markers, for example, are tetracycline resistance or ampicillin resistance. The words 
"cloning vehicle" are sometimes used for "vector." 

[00068] The terminology used herein is for the purpose of describing particular 
embodiments only, and is not intended to limit the scope of the present invention which will be 
limited only by the appended claims. Unless defined otherwise, all technical and scientific terms 
used herein have the same meanings as commonly understood by one of ordinary skill in the art 
to which this invention belongs. 

[00069] The present invention is further described and will be better understood by 
referring to the working examples set forth below. These non-limiting examples are to be 
considered illustrative only of the principles of the invention. Since numerous modifications and 
changes will readily occur to those skilled in the art, it is not desired to limit the invention to the 
exact construction and operation shown and described. Accordingly, all suitable modifications 
and equivalents may be used and will fall within the scope of the invention and the appended 
claims. 

EXAMPLES 

[00070] The present invention is further described by the following examples. These 
examples, while illustrating certain specific aspects of the invention, do not portray the 
limitations or circumscribe the scope of the disclosed invention. 
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EXAMPLE 1: Generation of Library of Mutants and Screening for Biofilm Formation 

[00071] Escherichia coli K12, substrain MG1655 csrA::kan AmotB AfimB-H (designated 
as TRMG F/M in this study) was used as the parent strain in generating the library of mini 
InlOwcam transposon mutants. The mutations that resulted in altered biofilm phenotypes as 
compared to the parent were then transduced either back into the parent or the closely related 
strain E. coli MG1655 csrA::kanR (designated in this study as TRMG1655). The lambda phage 
permissive host E. coli LE392 was used in the preparation of the X lysate for the transposon 
mutagenesis. Wild type E. coli K12, substrain MG1655 was used in the transduction and 
complementation studies involving four genes whose interruption resulted in a change in biofilm 
formation as compared to the parent. E. coli TOP 10 electrocompetent cells (Invitrogen, 
Carlsbad, CA) were used in the cloning of selected genes for the complementation studies. These 
competent cells contain no lad repressor gene, therefore, IPTG is not needed for blue/white 
screening of recombinants. The phage. 1NK1324 used in the transposon mutagensis contains 
the miniTn/0::az/n resistance marker, enabling selection of the transposon mutants by resistance 
to the antibiotic chloramphenicol (Kleckner, et. al., 1991). Candidate transposon mutants had 
their mutations confirmed by transduction using the Plv/r phage. This bacteriophage is strictly 
lytic and forms only clear plaques. The plasmid pCR2.1 (Invitrogen, Carlsbad, CA), used to 
clone selected genes for complementation, carries the ampicillin resistance gene as well as lacZa 
gene. It is supplied linearized with single 3 7 end thymidine overhangs and has topoisomerase 
covalently bound to the 3 7 end (see Table 1 for a summary of these mutants). 
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Table 1. Bacterial strains, phages and plasmids 



Strains, phages and plasmids Source 



Characteristics 



E. coli 

K12 strains 
MG1655 
TRMG1655 
TRMG F/M 

TOP10 

Bacteriophages 
XNK1324 
Plv/r 

Plasmids 
pCR2.1 



Michael Cashel 
Laboratory strain 
Laboratory strain 

Invitrogen 



ATCC 
Carol Gross 



Invitrogen 



Wild type 

MG1655 csrA::kanR 
TRMG1655zl motB AfimB-H 

uvrC279::Tn!0 
electrocompetent strain 



miniTnT 0::camR 

Strictly lytic PI, clear plaques 



TA cloning vector, 
Ampicillcin resistant 



Transposon Mutagenesis 

[00072] Protocols followed for transposon mutagenesis are based on the published 
procedures (Kleckner, et. al., 1991). 



Preparation of X NK1324 lysate 

[00073] The permissive host strain E. coli LE392 was grown to saturation in Tryptone Bl 
Broth with maltose and magnesium (TBMM) medium at 37°C overnight. A serial dilution was 
made of the ATCC X NK1324 stock in TMG buffer (1.2 g Tris base, 2.46 g MgS0 4 7H 2 0, 0.1 g 
gelatin, 1 liter distilled water, adjust pH to 7.4). Aliquots (0.1 ml) of each dilution were added to 
0.1 ml of the overnight LE392 culture. The resulting mixture was incubated for 20 minutes in a 
37°C water bath. Tryptone Broth (TB) top agar (3.0 ml), at a temperature of 45°C, was added to 
each tube and mixed well. This mixture was then poured atop a Tryptone Bl (TBI) bottom agar 
plate, allowed to solidify and incubated at 39.5 to 42°C overnight (7-8 hours). Large single 
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plaques were picked and individually placed in microcentrifuge tubes containing 0.2 ml LB broth 
plus 0.0 1M MgS0 4 . The mixtures were vortexed and allowed to stand at room temperature for 
10 minutes. In a small test tube, 0.1 ml of the dissolved plaque mixture was mixed with 0.1 ml 
of an overnight LE392 culture. This was incubated in a 37°C water bath for 20 minutes. Molten 
TB top agar (3 ml at 45 °C) was added and gently vortexed. This mixture was then poured atop a 
TBI bottom agar plate, allowed to solidify and incubated at 39.5 to 42°C overnight (7-8 hours). 
After incubation, the entire top agar surface was removed and placed into a sterile centrifuge 
tube. LB broth (1 ml) plus 0.01M MgSCU was used to rinse the plate. This rinse was added to 
the top agar surface scraping. To this, five drops of chloroform were added. After vortexing, 
the mixture was centrifiiged at 3000 x g (J-21C centrifuge, Beckman coulter, Fullerton, CA) for 
20 minutes at 4°C. The supernatant solution was recovered, and a drop of chloroform was 
added. It was stored at 4°C. 

Titration of X NK1324 lysate 

[00074] Serial dilutions of the phage lysate were prepared using TMG buffer. An 
overnight culture of LE392 (0.1 ml) was added to 0.1 ml of each dilution for final serial phage 
dilutions of 10 _t to 10" 8 . The mixtures were then incubated for 20 minutes in a 37°C water bath. 
Molten TB top agar (3 ml at 45°C) was mixed into the samples. This mixture was poured atop a 
TBI bottom agar plate and allowed to solidify. The plates were incubated at 40°C overnight. 
The plaques were counted and the pfii/ml of the lysate calculated. 

Construction of transposon mutants 

[00075] The parent strain TRMG F/M was subjected to insertion mutagenesis as follows. 
An overnight culture grown in TBMM was diluted 1:100 in TBMM for a total volume of 50 
ml. This was incubated at 37°C until the absorbance at 600 nm reached 1 (Ultraspec III, 
Pharmacia). The cells were then harvested by centrifugation at 7500 x g and 4°C (J-21C 
Beckman centrifuge). Cells were resuspended in 1 ml LB broth with 0.2% maltose. 
Concentrated cells (0.2 ml) were mixed with the ANK1324 lysate for a multiplicity of infection 
equal to 0.2. The mixture was incubated for 30 minutes at room temperature and then 90 
minutes at 37°C. Infected cells (0.1 ml) were then plated on Kornberg agar containing 1% 
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glucose, 2.5 mM sodium pyrophosphate and 25 jig/ml chloramphenicol. The plates were 
incubated overnight at 39°C. Colonies were picked and inoculated into wells on a 96 well 
microtiter tray containing Colony-Forming Antigen (CFA) medium plus 25 (ig/ml 
chloramphenicol. The microtiter trays were incubated at 26 °C without shaking for 24 hours. 
The 24-hour cultures were then used to inoculate the corresponding wells of another microtiter 
tray containing CFA plus 25 |ig/ml chloramphenicol for a final dilution of 1:100. These trays 
were then incubated at 26°C without shaking for 24 hours. The total turbidity and quantity of 
biofilm formed was determined by the assay described below. 

Biofilm Assay 

[00076] A 1:100 dilution of overnight culture and CFA plus 20 (ig/ml chloramphenicol 

was used to inoculate wells of a 96-well microtiter tray in triplicate. The trays were incubated 
for 24 hours at 26°C without shaking. Total turbidity of the wells was determined by measuring 
the absorbance at 630 nm. The amount of biofilm produced was determined by measuring the 
absorbance of crystal violet stained attached cells (Jackson et. al., 2002). This was accomplished 
by discarding the medium, rinsing the well three times with deionized water and staining the 
attached cells within the wells for one minute with crystal violet (.41%, W/V). The crystal violet 
was discarded and the wells rinsed again three times with deionized water. After allowing the 
wells to air dry, the dye was solubilized with 100 |il of 33% acetic acid. The absorbance of 
resulting solution was measured at 630 nm using a Dynatech MXR microtiter plate reader 
(Dynatech, Chantilly, Va.)(see Figure 4 A & B for results). 

Transduction 

[00077] Transduction was performed using Plv/r lysate (described above) from 

transposon mutants that displayed altered biofilm production as compared to the parent, TRMG 
F/M (Miller et. al., 1972). Recipient strains for transduction were: TRMG1655 (for mutants that 
displayed a decrease in biofilm) and TRMG F/M (for mutants that displayed an increase in 
biofilm). Recipient strains were grown overnight in LB and 0.2% glucose broth (1 ml) plus 50 
jig/ml kanamycin at 37°C while rotating at 250 rpm. The overnight cells were harvested by 
centrifugation and resuspended in 1 ml MC buffer (0.1 M MgS0 4 , 0.005 M CaCl 2 ). The 



26 



resulting suspension was aerated for 15 minutes at 37°C while rotating at 250 rpm. The reactions 
were incubated in a 37°C water bath for 20 minutes. Sodium citrate buffer (1 M, 0.2 ml) was 
added and the contents mixed well. The cells were then plated to Kornberg agar containing 0.5% 
glucose and chloramphenicol (25 jag/ml). The plates were incubated overnight at 37°C. 
Transductants were re-streaked onto Kornberg agar containing 0.5% glucose and 
chloramphenicol (25 |ig/ml) and incubated overnight. Colonies were tested for altered biofilm 
production as compared to the parent as previously described. 

EXAMPLE 2: Identification of Transposon Insertion Sites 

Arbitrarily primed polymerase chain reaction 

[00078] Arbitrarily primed polymerase chain reaction (PCR) was performed to amplify 

the chromosomal DNA flanking the transposon insertion sites (Pratt, et. a., 1998). Primers used 
in the first round were OUT1-L (designed to anneal to the one end of the insertion element) and 
ARB-1 (designed to anneal to arbitrary chromosomal sequences flanking the insertion). Primer 
PRIM1-L, used in the second round of the PCR, is designed to anneal to the same end of the 
insertion element as OUT1-L. However, PRIM1-L is specific for the extreme end, near the 
junction with the chromosome. ARB-2, used in the second round as well, is identical in 
sequence to the 5 ; end of the ARB-1 primer (Fig. 1). Alternatively, DNA flanking the insertion 
sites not successfully amplified with the above primers was subjected to PCR using CAM-5IN 
and CAM-50UT primers in conjunction with ARB-1 and ARB-2 primers described above. 
CAM-5IN and CAM-50UT are designed to anneal to the opposite end of the insertion element 
in the same manner as OUT1-L and PRIM1-L. For primer sequences refer to Table 3. 

[00079] The chromosomal template was prepared by inoculating LB broth (1 ml) plus 

0.2% glucose and chloramphenicol (20 jig/ml) with ice scrapings from the desired stock. This 
was incubated overnight at 37°C while rotating at 250 rpm. The overnight culture (200 jal) was 
centrifiiged for 30 seconds at 16,000 X g (MP4R centrifuge, International Equipment Co., 
Needham Heights, Mass.) and the supernatant discarded. The cells were resuspended in 100 \ii 
sterile deionized water. The cell suspension was then boiled for 5 minutes at 95°C and allowed 
to cool on ice for 2 minutes. Following centrifugation as described above, the supernatant 
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containing the chromosomal DNA was used as the template for the amplification. The PCR 
reactions procedures are based on protocols from Invitrogen using Elongase® enzyme mix 
(Invitrogen, Carlsbad, CA) and standard reaction conditions for PCR (Sambrook, et. al., 2001). 
The enzyme mix consists of a mixture of Taq and Pyrococcus species GB-D thermostable DNA 
polymerase. The Elongase® comes packaged with a buffering system that allows optimization 
of the reaction with respect to the amount of [Mg 2 ] used in the mixture. A PTC- 100 
programmable Thermal Cycler (MJ Research, Inc., Waltham, MA) was used to carry out the 
PCR reactions. The first round (25 \x\ total volume) of arbitrarily primed PCR was set-up as 
follows: 0.5 \i\ dNTP (10 mM), 0.25 \i\ OUT1-L primer (50 |iM), 0.25 \i\ ARB1 primer (50 
fiM), 5 \i\ Buffer B (final [Mg 2 ] 2.0 mM), 2.5 \x\ chromosomal DNA template, 16 \x\ sterile 
deionized water, 0.5 jal Elongase® enzyme mix. The solutions were gently mixed and placed 
into the Thermal Cycler. The first round conditions were: (1) 95°C for 30 seconds, (2) 95°C for 
30 seconds, (3) 43°C for 30 seconds, (4) 72°C for 2 minutes, (5) Cycle to step 2 for 30 times, (6) 
4°C for 12 hours. The second round reaction (50 jil total volume) was set up as follows: 1 jil 
dNTP (10 mM), 0.5 \x\ ARB2 primer (50 ^iM), 0.5 \l\ 1-L primer (50 >iM), 10 \x\ buffer B (final 
[Mg 2+ ] 2.0 mM), 5 jil DNA template (first round product), 32 \x\ sterile deionized water, 1 \A 
Elongase® enzyme mix. The solutions were mixed and placed into the Thermal Cycler. The 
second round reaction conditions were: (1) 95°C, for 30 seconds, (2) 95°C, for 30 seconds, (3) 
55°C for 30 seconds, (4) 72°C for 2 minutes, (5) cycle to step 2 for 30 times, (6) 4°C for 12 
hours. The PCR products were stored at -20°C. 



28 



Table 2. Sequence of Primers 

Primer Name Sequence (5 to 3 ) 

APvB-1 GGCCACGCGTCGACTAGTACNNNNNNNNNNGAT 

0UT1-L CAGGCTCTCCCCGTGGAG 

CAM-IN AAAGTGCGGGTGATGC 

ARB-2 GGCCACGCGTCGACTAGTAC 

PRIM1-L CTGCCTCCCAGAGCCTG 

CAM-OUT CTGACGGGGTGGTGCGTAACGGC 

nhaRF CACTCGTGAGCGCTTACAGCCG 

nhaRR GATTCCTCTATTTATTCGCCCGC 

leuOF CTAATCAACGAGGAAAAAGGGAC 

leuOR AATCGGCTGAATCCCACAACTTAC 

yjjQF CTTCTGACACATGCAGTGGAGTTG 

yjjQR GCGATTGTACGCTGAAATGAAAGCAC 

yojNF ATGCCACTGCATACTGATTAACCC 

yojNR CGGTGCAAATGCCAGATAAGACACTAAC 

M13UP CAGGAAACAGCTATGAC 

nhaRl AAACGTGAACTGATGCAAGGATCGC 

nhaR2 GATCATCGCTGGCTTGCTGATTGGC 

nhaR3 GATATTGTGAACTATCGCAAAGAATCC 

nhaR4 CGATGCACAATGCAATCTTCGTTGCC 
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Gel purification of PCR products 

[00080] PCR products were separated on 1% agarose gel using 0.5X TBE electrophoresis 

buffer (0.045 M Tris-borate, 0.001 M EDTA). Ethidium bromide was added to aid in 
visualization of the amplification product with UV light. The products were purified by gel 
extraction using QIAquick® Gel Extraction Kit following the manufacturers recommended 
protocol (QIAGEN, Valencia, CA). The concentration of the purified DNA was determined by 
the absorbance of a 1:10 dilution of the sample at 260 nm (GeneQuant III, Pharmacia Biotech, 
Piscataway, NJ). Absorbance readings at 230 nm, 280 nm, and 320 nm were also determined to 
check for protein and organic contaminants. Samples were stored at -20°C. . 

Sequencing of Insertion sites 

[00081] After determining the correct concentration of DNA needed (2.5 ng/jil are 

needed per 100 base pairs), the purified amplification products were sent to the University of 
Arizona's DNA sequencing facility. This lab, managed by the Laboratory of Molecular and 
Systematics Evolution (LSME), uses a fluorescence-based DNA sequencing system. 
Dideoxynucleotides, each labeled with a distinct flourochrome, produce chain terminating 
reactions and labeled products. The products are analyzed and sequences determined using an 
ABI PRISM 3700 DNA Analyzer (Applied Biosystems Incorporated, Foster City, Ca). The 
sequences are made available on their Internet web site ( h ttp : llxxo fadna . arl . ari zona. edu ) and 
downloaded for further analysis. 

Identification of genes interrupted by the transposon 

[00082] Identification of the insertion sites was made by performing BLAST searches. 
Basic Local Alignment Search Tool, or BLAST, is a program sponsored by the NCBI (National 
Center for Biotechnology Information). BLAST is a set of similarity search programs that are 
designed to search databases for matches to a query sequence (Altschul, et. al., 1990). The 
sequence information obtained from the University of Arizona was searched with this program 
against Escherichia coli K12 databases maintained at NCBI. 

Complementation studies 
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[00083] Four genes not previously known to be involved in biofilm formation, and whose 
interruption resulted in altered biofilm development, were chosen for complementation studies. 



Transduction of E. coli MG1655 

[00084] Plv/r lysates from transposon mutants whose interrupted gene was chosen for 

complementation were used to co-transduce (previously described) the chloramphenicol 
resistance marker into E. coli K12 substrain MG1655. Colonies were picked and plated to 
Kornberg agar containing 0.5% glucose and chloramphenicol (25 |xg/ml). After overnight 
growth, isolated colonies were used to inoculate LB broth plus 0.2% glucose (1 ml) with 
chloramphenicol (25 |ig/ml) and incubated overnight at 37°C while rotating at 250 rpm. Biofilm 
assays were performed and stock cultures prepared as previously described. The results of the 
biofilm assay were subjected to analysis of variance testing (ANOVA) to determine if changes 
seen were statistically significant as compared to the wild type. 

Primer design and PCR conditions for selected genes 

[00085] Primers were designed to include the predicted promoter region as well as the 

stop codons for each gene or operon selected (Fig. 2). PCR conditions and reaction mixtures are 
based on protocols from Invitrogen (Invitrogen, Carlsbad, CA), using Elongase® enzyme mix 
and standard reaction conditions for PCR (Sambrook, et. al., 2001). A manual hot start was used 
at the beginning of each PCR protocol to maximize the production of the desired PCR product. 
PCR was carried out using a PTC- 100 programmable Thermal Cycler (MJResearch Inc., 
Waltham, MA). For primers used, refer to Table 2. A description of the PCR conditions for 
each gene or operon follows. 

I. nhaR -The segment is 2.4 kb in length. 

PCR conditions: (1) 94°C, 30 seconds; (2) 94°C, 30 seconds; (3) 55°C, 30 seconds; 
(4) 68°C, 3 minutes; (5) cycle to step 2, thirty times; (6) 72°C, 10 minutes; (7) 4°C, 
12 hours. 

II. leuO - The segment is 1 .6 kb in length. 

PCR conditions: (1) 94°C, 30 seconds; (2) 94°C, 30 seconds; (3) 55°C, 30 Seconds; 
(4) 68°C, 2 minutes; (5) cycle to step 2, thirty times; (6) 72°C, 10 minutes; (7) 4°C, 
12 hours. 

' III. yjjQ - The segment is 1 .6 kb in length. 

PCR conditions: (1) 94°C, 30 seconds; (2) 94°C, 30 seconds; (3) 57°C; 30 seconds; 
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(4) 68°C, 2 minutes; (5) cycle to step 2, thirty times; (6) 72°C, 10 minutes; (7) 4°C, 
12 hours. 

IV. yojN- The segment is 4.1 kb in length ion. 

PCR conditions: (1) 94°C, 30 seconds; (2) 94°C, 30 seconds; (3) 55°C, 30 seconds; 
(4) 68°C, 4 minutes; (5) cycle to step 2, forty times; (6) 72°C, 15 minutes; (7) 4°C, 
12 hours. 

[00086] The PCR reactions were carried out in 50 jal total volume amounts using the 

following reagents: 1 |il dNTP (10 mM), 0.5 |il forward primer (50 jiM), 0.5 jil reverse primer 
(50 |iM), 1 \A chromosomal DNA template (490 ng/jil), 10 \i\ buffer B (final [Mg 2 ] 2.0 mM), 36 
j^l sterile deionized water, 1 \i\ Elongase® enzyme mix. 

Purification of long PCR products 

[00087] PCR products for cloning were gel purified as recommended by TOPO PCR 
cloning kit for long PCR products (Invitrogen, Carlsbad, CA). Briefly, PCR products were 
separated on a 0.8% agarose gel made with IX TAE buffer (0.04 M Tris-acetate and 0.001 M 
EDTA), containing 2 mg/ml Crystal Violet solution for visualization of DNA. Eight microliters 
of 6X Crystal Violet Loading Buffer (Invitrogen, Carlsbad, CA) were used to load the PCR 
product onto the gel. After electrophoresis, the PCR product was excised from the gel and 
placed into a sterile microcentrifuge tube, weighted, and 2.5 volumes of 6.6 M sodium iodide 
were added and mixed by vortexing. The sample was then incubated at 50°C until the agarose 
completely melted. Binding buffer (Invitrogen, Carlsbad, CA) was mixed in (1.5 volumes). The 
suspension was loaded onto a S.N.A.P. (Invitrogen, Carlsbad, CA) purification column. DNA 
was bound to the membrane in the column by centrifugation at 2500 x g for 30 seconds (MP4R 
centrifuge, International Equipment Co.). To ensure the entire amount of DNA was harvested 
from the solution, the filtrate was poured back into the column and this step repeated two times. 
The column was washed twice with 400 \l\ of Final Wash (Invitrogen, Carlsbad, CA), containing 
75% ethanol and 400 mM NaCl. After allowing the column membrane to air dry, the DNA was 
eluted by applying 40 jil TE buffer and centrifuging at 16,000 X g for one minute (MP4R 
centrifuge, International Equipment Co.). Absorbance readings of a 1:10 dilution at 260 nm 
were made to determine the concentration of the purified PCR products (GeneQuantll, 
Pharmacia Biotech). The DNA was stored at -20°C. 
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Ligation 

[00088] The purified PCR products were ligated into the pCR2.1 plasmid vector (Fig. 3) 

using the TOPO TA cloning kit as described by the manufacturer (Invitrogen, Carlsbad, CA). 

Preparation of electrocompetent cells 

[00089] Overnight cultures (2.5 ml) were used to inoculate 500 ml of LB broth plus 0.2% 

glucose and 20 jxg/ml chloramphenicol in a 2000 ml flask. This was incubated at 37°C while 
rotating at 250 rpm until an optical density at 600 nm reached approximately 0.8. 

[00090] (Ultraspec III, Pharmacia Biotech). Aliquots of the cells (250 ml) were 

transferred to chilled 250 ml sterile centrifuge bottles (Nalgene, Co.) and pelleted by centrifuging 
at 4°C for 15 minutes at 7520 x g (J-21C Beckman centrifuge). The cells were resuspended in 
250 ml ice cold 10% glycerol and pelleted again for 15 minutes as described above. After 
decanting the supernatant, the cells were resuspended in 125 ml of ice cold 10% glycerol. 
Repeating the above centrifiigation, the cells were pelleted a third time and resuspended in 62.5 
ml ice cold 10% glycerol. The cell suspensions were combined into one 125 ml portion and 
pelleted as previously described. Each cell pellet was suspended in 1.5 ml ice cold 10% 
glycerol. Aliquots (150 (il) of the suspensions were dispensed into sterile 1.5 ml 
microcentrifuge tubes and stored at -80°C. 

Transformation 

[00091] Electrocompetent cells, prepared as described above, or E. coli TOP 

electrocompetent cells (Invitrogen, Carlsbad, CA) were thawed on ice and transformed by 
mixing 2-3 jal of DNA with 50 (il of the thawed cells. The mixture was then transferred to a 
pre-chilled 0.1 cm electrode gap electroporation cuvette (Bio-Rad Laboratories, Hercules, CA) 
and electroporated using a Gene Pulser II Apparatus (Bio-Rad Laboratories). Electroporation 
settings were 1.8 kilovolts, 25 microfaradays, and 200 ohms with a maximum time of 4.5 to 5 
milliseconds. Cells were immediately recovered after electroporation in 1 ml room temperature 
SOC medium [2% Tryptone, 0.5% yeast extract, 10 mM NaCl, 2.5 mM KC1, 10 mM MgCl 2 , 10 
mM MgS0 4 , and 20 mM glucose (Invitrogen, Carlsbad, CA)]. Cell suspensions were transferred 
to a 15 ml culture tube and incubated for at least one hour at 37°C with gentle rotation for 
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aeration. Various volumes (20 50 \x\, 80 100 jil and 150 |il) of the electroporated cell 
suspensions were used to inoculate LB or Kornberg agar plates containing the carbenicillin (100 
jig/ml). X-gal (40 \xl, 40 mg/ml) reagent was added to the media for the blue/white screening of 
the pCR2.1 vector constructs containing the cloned inserts. The plates were incubated overnight 
at 37°C. 

Selection of transformed cells 

[00092] Colonies displaying the correct selection criteria, as indicated below, were 

grown overnight in 1ml LB broth containing 0.2% glucose and 100 (ig/ml carbenicillin. Stocks 
of the overnight cultures were prepared as previously described. Presumptive transformants 
having vector constructs with the cloned inserts were identified by blue/white screening and 
resistance to carbenicillin. Transformed transposon mutants or transductants were determined by 
resistance to carbenicillin. 

Isolation of plasmid constructs from E. coli TOP 10 cells 

[00093] Mini -preparations of plasmid from E. coli TOP 10 cells were obtained using 

Quantum Prep® Plasmid miniprep kit (BioRad Laboratories) as described by the manufacturer. 
Alternatively, mini-preparations were made by centrifuging an overnight culture (1.5 ml) at 
10,000 x g for 1- 2 minutes (MP4R, International Equipment Co.). The cell pellets were 
suspended in 150 \i\ ice cold SET buffer. Freshly made lysis buffer (350 (il; 20 \x\ 10N NaOH, 1 
ml 10% SDS, 8.8 ml sterile deionized water) was added. The suspension was gently mixed and 
placed on ice for 10 minutes. Ice cold KAC (250 |il; 3 M K+, 5 M Acetate) was added, the 
mixture was inverted several times and placed on ice for 10 minutes. After centrifuging the 
tubes for 10 minutes at 16,000 x g and 4°C (MP4R, International Equipment Co.), the 
supernatant solution was transferred to a clean tube. An equal amount of room temperature 
100% isopropanol was added and the tubes centrifiiged for 5 minutes at 16,000 x g. After 
carefully removing the supernatant, the pellet was resuspended in 200 jul TE buffer (10 mM Tris 
CI, 1 mM EDTA pH 8.0). RNaseA (2 |lx1; 20 ng /ml) was added and the mixture was incubated at 
37°C for 30 minutes. Phenolxhloroform (200 jil, 1:1 ratio) was added. After mixing and 
centrifugation (16,000 x g for 10 minutes at 4°C), the upper phase was collected and transferred 
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to a clean tube. An equal volume of chloroform was added and the mixture centrifiiged as 
previously described. The upper phase was transferred to a clean tube and 4 M NaCl was added 
to yield a final concentration of 0.2 M. To this, 2 volumes of 100% ethanol were added and the 
sample was incubated 15 minutes at -80°C. The DNA was sedimented by centrifuging the 
sample at 16,000 x g for 10 minutes at 4°C. After decanting the supernatant, the DNA was 
washed with 100% prechilled ethanol The sediment was air dried and resuspended in sterile 
deionized water (20(il) and stored at -20°C. Larger scale plasmid preparations were made using 
QIAGEN® Midi-plasmid Purification kit according to the manufacturer's protocol (QIAGEN 
Inc., Valencia, CA). The concentration of plasmid DNA was obtained by reading the absorbance 
of a 1:10 dilution of the preparation at 260 nm (GeneQuant II, Pharmacia Biotech). 

Restriction enzyme digestion of plasmid constructs 

[00094] The recombinant plasmids were subjected to restriction enzyme digestion as a 

means of screening for the presence of the desired construct. Digestions were performed by 
mixing 2 jal of enzyme specific buffer, 1 jil digestion enzyme (Promega, Madison, WI), 1 )il to 4 
(il plasmid DNA preparation and sterile deionized water, up to a total volume of 20 The 
digestions were incubated for 3 hours in a 37°C waterbath. The nhaR construct was digested 
with Hindlll (10 U/|al) and EcoRV (10 U/|il) enzymes. The leuO construct was digested with 
Apa\ (12 \JI\x\ )and EcoRV, The construct containing the yjjQ gene was digested with the 
enzymes AlwAAX (10 U/|il) and EcoRV. The yojN construct was digested with Bgli (10 U/jal) 
and EcoRV. 

Transformation of transposon mutants or transductants 

[00095] Transposon mutants or transductants were transformed using the pCR2.1 

plasmid constructs as previously described. A vector only control (pCR2.1 without an insert) 
was transformed into the chosen strains as a control. Selection of transformants was based on 
antibiotic resistance as previously described. Biofilm assays (previously described) were 
performed and the results compared to that of the parent strain. 

Sequencing of cloned gene 
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[00096] The cloned gene in the recombinant plasmid designated pCRnhaR (which 

successfully complemented the corresponding transposon mutant) was sequenced to ensure the 
inserted gene was free of any PCR-generated mutations. This was accomplished by using a 
primer specific for a sequence upstream of the insertion site (M13UP) as well as several primers 
designed to anneal to sequences within the insert (nhaRl, nhaR2, nhaR3 and nhaR4). Sequences 
of these primers can be found on Table 2. Plasmid preparations were made and DNA 
concentrations determined as described previously. The samples were then sent to the University 
of Arizona for sequencing. The sequence information was compared to the E .coli K12 
databases at the NCBI internet web site for confirmation. 

EXAMPLE 3: Genes Altering Biofilm Formation 

[00097] Transposon mutagenesis produced 800 random insertion mutants. Using 

microtiter assay to screen for altered biofilm formation ability, 120 of these mutants displayed an 
increase or decrease in biofilm production as compared to the parent. BLAST searches against 
Escherichia coli K12 genomic databases revealed 120 matches. Of this total, 61 were from 
mutant strains with a decrease in biofilm formation and 59 were from mutant strains with an 
increase in biofilm production (Tables 3 & 4). Many of the same genes were interrupted in more 
than one mutant. Multiple mutations were isolated from several operons as well. The genes from 
XXizycd operon, for example, were interrupted in 22 mutants that exhibited decreased biofilm. 
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Table 3. Mutant strains with a decrease in biofilm formation compared to parent TRMG. 



Gene/Function 



Strain Mutation location Biofilm Ratio 
Relative to inition codon (Parent:Mutant) 



aroD - amino acid biosynthesis 87C5 



90E7 



bl904 -protein modulating ferritin 
encoding gene fin 



11E4 



16C8 



31G6 

b2531 -protein modulating biofilm 91F9 
formation 



clpX- serine protease chaperone 

fucA - degradation of carbon 

compounds; L-fuculose- 
1 -phosphate aldolase 

glnE - adenylyl transferase; 
activation/deactivation 
of glutamine synthetase 

hscA - heat shock protein; 
Hsp 70 family 

nhaR - regulation of cation 
transport 

leuL - leader peptide; leucine 
biosynthesis 

leuO - regulator, LysR-like; 
leucine biosynthesis 



63E2 
24E12 

68E11 

35C6 

96B10 

109B4 

14B7 

41B10 

41G10 



+259 



+267 



79 



-49 

-78 
+67 

- 10 
+43 

+2176 

+1426 

+432 

-113 

+150 
+150 
+150 



1.5:1 
2:1 
2:1 

24:1 
2:1 
2:1 

2:1 
8:1 

2:1 

6:1 

11:1 

2:1 

4:1 
9:1 
10:1 



37 



Gene/Function 



Strain Mutation location Biofilm Ratio 
Relative to inition codon (Parent:Mutant) 



leuO - continued 



miaA - modification of tRNA; 
aminoacyl tRNA 
synthetases 

rhoaC - enzyme; molybdopterin 
synthesis 

modA - molybdate uptake; peri- 
plasmic molybdate 
binding protein 

modC - molybdate uptake; ATP- 
binding component of 
molybdate transport 

pr/C - protein synthesis; release 
factor - 3 



89E11 
102G9 

106B2 

126C5 

140B5 



75E11 
123F5 

51A10 
56C11 
150G7 



ptrB - protein degradation; protease 32A4 

rbsK - degradation of carbon 1 25 A7 
compounds; ribokinase 

rnpB - RNA degradation; Rnase P 90 A 1 1 

trs5_7 - IS phage; transposase 48 A4 

weal - colanic acid biosynthesis; 68 A3 
glycosyl transferase 



ycdP - polysaccharide adhesin 
synthesis 



150E3 



+150 
+150 

+150 

+150 

+731 

+1335 
+255 

+1136 
+1136 
+955 

-16 

+361 

+171 
+865 
+196 

+146 



8:1 
5:1 

5:1 

7:1 

9:1 

2:1 
4:1 

2:1 
5:1 
4:1 

14:1 
2:1 

3:1 
20:1 
>24:1 



20:1 
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Gene/Function Strain Mutation location Biofilm Ratio 
Relative to inition codon (Parent:Mutant) 

ycdQ -protein product homologous 1G3 +672 16:1 

to IcaA in Staphylococcus 46E5 +204 10:1 
aureus; biosynthesis of 

polysaccharide adhesin 86E7 +471 10:1 

86F11 +471 11:1 

ycdR - protein product homologous 84A10 +1461 11:1 
to IcaB in S. aureus; 

biosynthesis of polysaccharide 93E3 +1461 12:1 
adhesin 

115C12 +1216 16:1 

155F4 +510 20:1 

169G4 +868 18:1 

ycdS -outer membrane protein 12F12 +672 23:1 

26G11 +2064 8:1 

31 A4 +1124 7:1 



44C2 +1124 10:1 

49C2 . +546 9:1 

49G12 +672 9:1 

66E10 +2884 12:1 



73E6 ' +2055 16:1 

73E9 +1987 12:1 

73F2 +2061 12:1 

110G8 +908 9:1 

141G4 +956 18:1 
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Gene/Function 



Strain Mutation location Biofilm Ratio 
Relative to imtion codon (Parent;Mutant) 



yecK -enzyme; 

cytochrome-C type 
protein 

yedK -protein - cellular 
respiration 

yjcC -protein modulates 
biofilm formation 

yjjQ -regulator modulates 
biofilm formation 



ykgK -regulator modulates 
biofilm formation 



62C9 +479 20:1 

130E8 -26 24:1 

37B5 +649 11:1 

27F9 -54 2:1 



141G2 +502 16:1 

160A8 +506 5:1 

145F10 +344 3:1 
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Table 3. Mutant strains with an increase in biofilm formation compared to parent TRMG F/M. 



Gene/Function 



Strain Mutation location Biofilm Ratio 
Relative to inition codon fParent: MutanD 



arcB - global regulatory function, 1 36E3 
respiration ; protein kinase/ 
phosphatase 

bl936 -protein; 77A5 
homologous to intG, 
a defective integrase gene 
in Escherich ia coli 0 1 5 7 : H7 

dnaKp2 promoter - Hsp chaperone; 64F2 
DNA synthesis, and motility 



flgB - flagellar biosynthesis; 
basal-body rod 



flgE - flagellar biosynthesis; 
hook Protein 

flgH- flagellar biosynthesis; 
ring protein 



115B7 
167C2 
14C10 

98E11 



flgl - flagellar biosynthesis; product 1 52B4 
homologous to basal body 
P-ring in Salmonella 

flgK - flagellar biosynthesis; hook- 1 1 1 G8 
filament junction protein 

flhA - flagellar biosynthesis; . 126 A3 
flagella regulon member 

126G2 



flhB - flagellar biosynthesis; 

flagella regulon member 



61G2 



+710 



+1057 



flhD - flagellar biosynthesis; 1 1 OF 12 

transcription initiation factor 



+12 

+117 
+102 
+462 

+11 

+1062 

+152 

+907 
+907 
-20 

-95 



1:7 



1:11 



1:9 

1:7 
1:8 
1:10 

1:5 

1:9 

1:4 

1:7 
1:7 
1:10 

1:9 
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Gene/Function 



Strain Mutation location Biofilm Ratio 
Relative to inition codon (Parent:Mutant) 



fliA - flagellar biosynthesis; 
Alternate sigma factor 

fliD - flagellar biosynthesis; 
filament capping protein 



78E3 



1B10 



115A3 



163E7 



163F3 



fliF, predicted promoter - flagellar 85G 1 1 
biosynthesis; basal body ring 



fliG - flagellar biosynthesis; 
motor component 



flil- flagellar biosynthesis; 
flagellum ATP synthase 



fliL - flagellar biosynthesis; 
surface structure 

fliM- flagellar biosynthesis; 
motor component 



43B10 

89A8 

67E10 

70G12 

122F6 

13G10 

38G7 

51B12 

66F4 

67C8 

67C9 

76F11 

132B8 



+594 

+27 
+494 
+27 
+27 
-5 

+664 

+802 

+747 

+1349 

+152 

+204 
+488 
+535 
+489 
+481 
+481 
+484 
+176 



1:7 

1:10 

1:2 

1:4 

1:8 

1:6 

1:11 

1:6 

1:6 

1:5 

1:4 

1:11 
1:11 
1:6 
1:10 
1:6 
1:8 
1:5 
1:7 
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Gene/Function 



Strain Mutation location Biofilm Ratio 
Relative to inition codon fParent:Mutant) 



fliP - flagellar biosynthesis; 
flagella regulon member 



fliR - flagellar biosynthesis; 
flagella regulon member 



frdA - enzyme; anaerobic 
respiration, fumarate 
reductase 



rep - enzyme; helicase, 

DNA-dependent ATPase 



42G6 

49G9 

57E7 

103C8 

110E8 

125C9 

98E7 

123E4 

45C8 



mreB - cell division; cell shape, 104G4 
regulator of peptidoglycan 
synthetase 

mdoG - enzyme; periplasmic 98G12 
oligosaccharide synthesis 



39C9 



rfaG - enzyme; LPS biosynthesis, 12E12 
glucosyltransferase I 

74B5 

rfaP - enzyme; LPS biosynthesis, 139G5 
hepatose phosphorylation 



+340 
+388 
+395 
+397 
+390 
+397 
+585 
+130 
+67 

-121 

+142 

+270 

+163 

+62 

+496 



1:12 

1:12 

1:7 

1:9 

1:6 

1:7 

1:6 

1:6 

1:4 

1:2 

1:5 

1:3 

1:7 
1:2 
1:5 
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Gene/Function 



Strain Mutation location Biofilm Ratio 
Relative to inition codon (Parent:Mutant) 



rfaQ - enzyme; LPS core 
biosynthesis 



rnhB - enzyme; degradation of , 
RNA, RNAseHII 



63A9 



71A4 



94G6 



107B12 



123B8 



125F2 



92G7 



92G9 



tolA - periplasmic protein; outer 78F6 
membrane integrity 

wecB - enzyme; UDP-N-acetyl 
glucosamine-2-epimerase, 
sugar nucleotide biosynthesis 

yojN- regulator; part of 

a phosphorelay signaling 
pathway; affects swarming 
behaviour 



43F5 



103E8 



+380 
-44 
-44 
-65 
+361 
+104 
+500 
+500 
+1189 

+1112 
+2147 



1:4 

1:2 

1:4 

1:2 

1:3 

1:3 

1:3 

1:3 

1:2 

1:2 
1:7 



yhjH -protein modulates 
biofilm formation 



36E2 



+678 



1:3 
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